AITopics | user-level differential privacy

bin, inequality follow, variance, (13 more...)

Neural Information Processing Systems

Industry: Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Security & Privacy (0.67)
Information Technology > Artificial Intelligence (0.46)

Add feedback

Matrix Factorization for Practical Continual Mean Estimation Under User-Level Differential Privacy

Kalinin, Nikita P., Najar, Ali, Roth, Valentin, Lampert, Christoph H.

arXiv.org Machine LearningFeb-2-2026

We study continual mean estimation, where data vectors arrive sequentially and the goal is to maintain accurate estimates of the running mean. We address this problem under user-level differential privacy, which protects each user's entire dataset even when they contribute multiple data points. Previous work on this problem has focused on pure differential privacy. While important, this approach limits applicability, as it leads to overly noisy estimates. In contrast, we analyze the problem under approximate differential privacy, adopting recent advances in the Matrix Factorization mechanism. We introduce a novel mean estimation specific factorization, which is both efficient and accurate, achieving asymptotically lower mean-squared error bounds in continual mean estimation under user-level differential privacy.

artificial intelligence, factorization, machine learning, (13 more...)

arXiv.org Machine Learning

2601.2232

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Denmark > Capital Region > Copenhagen (0.04)
Europe > Austria (0.04)

Genre: Research Report (0.49)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.45)

Add feedback

Differentially Private Model Personalization

Neural Information Processing SystemsDec-25-2025, 07:30:59 GMT

We study personalization of supervised learning with user-level differential privacy. Consider a setting with many users, each of whom has a training data set drawn from their own distribution $P_i$. Assuming some shared structure among the problems $P_i$, can users collectively learn the shared structure---and solve their tasks better than they could individually---while preserving the privacy of their data? We formulate this question using joint, user-level differential privacy---that is, we control what is leaked about each user's entire data set. We provide algorithms that exploit popular non-private approaches in this domain like the Almost-No-Inner-Loop (ANIL) method, and give strong user-level privacy guarantees for our general approach. When the problems $P_i$ are linear regression problems with each user's regression vector lying in a common, unknown low-dimensional subspace, we show that our efficient algorithms satisfy nearly optimal estimation error guarantees. We also establish a general, information-theoretic upper bound via an exponential mechanism-based algorithm.

differentially private model personalization, name change, user-level differential privacy, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

User-Level Differential Privacy With Few Examples Per User

Neural Information Processing SystemsDec-24-2025, 19:36:23 GMT

STOC 2023] obtained generic algorithms that work for various learning tasks. However, their focus was on the *example-rich* regime, where the users have so many examples that each user could themselves solve the problem. In this work we consider the *example-scarce* regime, where each user has only a few examples, and obtain the following results:* For approximate-DP, we give a generic transformation of any item-level DP algorithm to a user-level DP algorithm. Roughly speaking, the latter gives a (multiplicative) savings of $O_{\varepsilon,\delta}(\sqrt{m})$ in terms of the number of users required for achieving the same utility, where $m$ is the number of examples per user. This algorithm, while recovering most known bounds for specific problems, also gives new bounds, e.g., for PAC learning.

algorithm, name change, user-level differential privacy, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.80)

Add feedback

12036_mean_estimation_with_user_leve

Audra McMillan

Neural Information Processing SystemsAug-18-2025, 07:45:35 GMT

Thus, we are running the estimator promised in Lemma F.2 on

artificial intelligence, inequality follow, variance, (14 more...)

Neural Information Processing Systems

Industry: Information Technology > Security & Privacy (0.67)

Technology:

Information Technology > Security & Privacy (0.67)
Information Technology > Artificial Intelligence (0.46)

Add feedback

Differentially Private Model Personalization

Neural Information Processing SystemsJan-19-2025, 14:30:22 GMT

We study personalization of supervised learning with user-level differential privacy. Consider a setting with many users, each of whom has a training data set drawn from their own distribution P_i . Assuming some shared structure among the problems P_i, can users collectively learn the shared structure---and solve their tasks better than they could individually---while preserving the privacy of their data? We formulate this question using joint, user-level differential privacy---that is, we control what is leaked about each user's entire data set. We provide algorithms that exploit popular non-private approaches in this domain like the Almost-No-Inner-Loop (ANIL) method, and give strong user-level privacy guarantees for our general approach.

algorithm, differentially private model personalization, user-level differential privacy

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

User-Level Differential Privacy With Few Examples Per User

Neural Information Processing SystemsOct-11-2024, 11:22:19 GMT

STOC 2023] obtained generic algorithms that work for various learning tasks. However, their focus was on the *example-rich* regime, where the users have so many examples that each user could themselves solve the problem. In this work we consider the *example-scarce* regime, where each user has only a few examples, and obtain the following results:* For approximate-DP, we give a generic transformation of any item-level DP algorithm to a user-level DP algorithm. Roughly speaking, the latter gives a (multiplicative) savings of O_{\varepsilon,\delta}(\sqrt{m}) in terms of the number of users required for achieving the same utility, where m is the number of examples per user. This algorithm, while recovering most known bounds for specific problems, also gives new bounds, e.g., for PAC learning.

algorithm, dp algorithm, user-level differential privacy, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.85)

Add feedback

User-Level Differential Privacy With Few Examples Per User

Ghazi, Badih, Kamath, Pritish, Kumar, Ravi, Manurangsi, Pasin, Meka, Raghu, Zhang, Chiyuan

arXiv.org Artificial IntelligenceSep-21-2023

Previous work on user-level differential privacy (DP) [Ghazi et al. NeurIPS 2021, Bun et al. STOC 2023] obtained generic algorithms that work for various learning tasks. However, their focus was on the example-rich regime, where the users have so many examples that each user could themselves solve the problem. In this work we consider the example-scarce regime, where each user has only a few examples, and obtain the following results: 1. For approximate-DP, we give a generic transformation of any item-level DP algorithm to a user-level DP algorithm. Roughly speaking, the latter gives a (multiplicative) savings of $O_{\varepsilon,\delta}(\sqrt{m})$ in terms of the number of users required for achieving the same utility, where $m$ is the number of examples per user. This algorithm, while recovering most known bounds for specific problems, also gives new bounds, e.g., for PAC learning. 2. For pure-DP, we present a simple technique for adapting the exponential mechanism [McSherry, Talwar FOCS 2007] to the user-level setting. This gives new bounds for a variety of tasks, such as private PAC learning, hypothesis selection, and distribution learning. For some of these problems, we show that our bounds are near-optimal.

user-level differential privacy

arXiv.org Artificial Intelligence

2309.125

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.73)

Add feedback

Mean Estimation with User-level Privacy under Data Heterogeneity

Cummings, Rachel, Feldman, Vitaly, McMillan, Audra, Talwar, Kunal

arXiv.org Artificial IntelligenceJul-28-2023

A key challenge in many modern data analysis tasks is that user data are heterogeneous. Different users may possess vastly different numbers of data points. More importantly, it cannot be assumed that all users sample from the same underlying distribution. This is true, for example in language data, where different speech styles result in data heterogeneity. In this work we propose a simple model of heterogeneous user data that allows user data to differ in both distribution and quantity of data, and provide a method for estimating the population-level mean while preserving user-level differential privacy. We demonstrate asymptotic optimality of our estimator and also prove general lower bounds on the error achievable in the setting we introduce.

data mining, machine learning, variance, (18 more...)

arXiv.org Artificial Intelligence

2307.15835

Country: